Event Schema Induction Based on Relational Co-occurrence over Multiple Documents
نویسندگان
چکیده
Event schema which comprises a set of related events and participants is of great importance with the development of information extraction (IE) and inducing event schema is prerequisite for IE and natural language generation. Event schema and slots are usually designed manually for traditional IE tasks. Methods for inducing event schemas automatically have been proposed recently. One of the fundamental assumptions in event schema induction is that related events tend to appear together to describe a scenario in natural-language discourse, meanwhile previous work only focused on co-occurrence in one document. We find that semantically typed relational tuples co-occurrence over multiple documents is helpful to construct event schema. We exploit the relational tuples co-occurrence over multiple documents by locating the key tuple and counting relational tuples, and build a co-occurrence graph which takes account of co-occurrence information over multiple documents. Experiments show that co-occurrence information over multiple documents can help to combine similar elements of event schema as well as to alleviate incoherence problems.
منابع مشابه
An Approach to Eliminate Semantic Heterogenity Using Ontologies in Enterprise Data Integeration
XML syntax and semantic validations are critical to the correct service transaction specification and service integration based on existing distributed and heterogeneous computing services. Current industry practice of XSLT-based Schematron validation may produce invalid results, and contributes a reusable XML validator component that supports sound integrated syntax/semantic validations and ev...
متن کاملTranslating Relational Schema with Constraints into Xml Schema
With XML adopted as technology trend on the Internet, and with investment in the current relational database systems, companies must convert their relational data into XML documents for data transmission on the Internet. In the process, to preserve the users’ relational data requirements of data constraints into the converted XML documents, we must define a meaningful root element for each XML ...
متن کاملQuerying XML Documents from a Relational Database in the Presence of DTDs
Many researchers have investigated the problem of storing and querying XML documents using an RDBMS. Two situations are considered in this approach based on whether or not an XML schema is available. In a schemaoblivious relational approach, an XML schema is not available, or it is available but is not used. The advantage of schema-oblivious relational approach is that no XML schema is required...
متن کاملA DOM-Based Approach of Storage and Retrieval of XML Documents Using Relational Databases
This paper describes a novel approach of storage and retrieval of XML documents using relational databases. In this approach, an XML document is decomposed into nodes based on its tree structure, and stored into relational tables according to the nodes types. Our approach enables us to store XML documents using a fixed relational schema without any information about XML schema, and DTD. For the...
متن کاملRule Learning from Semi-structured Documents by Inductive Logic Programming
One of the hot research areas is knowledge discovery on structured documents like HTML and XML documents. In the case of XML documents, most popular approach to mining a knowledge is structural approach which find some kind of similar pattern(often tree structure or XPath) in interested XML documents. On the other hand, there is relational data mining approach such as ILP(Inductive Logic Progra...
متن کامل